All Greek to me! An automatic Greeklish to Greek transliteration system

نویسندگان

  • Aimilios Chalamandaris
  • Athanassios Protopapas
  • Pirros Tsiakoulis
  • Spyros Raptis
چکیده

This paper presents research on “Greeklish,” that is, a transliteration of Greek using the Latin alphabet, which is used frequently in Greek e-mail communication. Greeklish is not standardized and there are a number of competing conventions co-existing in communication, based on personal preferences regarding similarities between Greek and Latin letters in shape, sound, or keyboard position. Our research has led to the development of “All Greek to me!,” the first automatic transliteration system that can cope with any type of Greeklish. In this paper we first present previous research on Greeklish, describing other approaches that have attempted to deal with the same problems. We then provide a brief description of our approach, illustrating the functional flowchart of our system and the main ideas that underlie it. We present measures of system performance, based on about a year’s worth of usage as a public web service, and preliminary research, based on the same corpus, on the use of Greeklish and the trends in preferred Latin-Greek letter mapping. We evaluate the consistency of different transliteration patterns among users as well as the within-user consistency based on coherent principles. Finally we outline planned future research to further understand the use of Greeklish and improve “All Greek to me!” to function reliably embedded in integrated communication platforms bridging e-mail to mobile telephony and ubiquitous connectivity.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Random Forests Text Transliteration System for Greek Digraphia

Greeklish to Greek transcription does undeniably seem to be a challenging task since it cannot be accomplished by directly mapping each Greek character to a corresponding symbol of the Latin alphabet. The ambiguity in the human way of Greeklish writing, since Greeklish users do not follow a standardized way of transliteration makes the process of transcribing Greeklish back to Greek alphabet ch...

متن کامل

Querying the Greek Web in Greeklish

In this paper, we experimentally study the problem of querying the web in a hybrid language, namely Greeklish. Greeklish is the transliteration of Greek in Latin characters of the ASCII code. Although Greeklish emerged as a convenient mean for the creation and distribution of digital data at a time when Unicode Transformation Format was not supported for the Greek alphabet, nevertheless it is s...

متن کامل

gr2ǫλ: A Greeklish-to-Greek converter

Greeklish is a transliteration of the Greek language written using Roman characters. This phenomenon started in the 1980’s, when the Greek language was unfortunately covered by multiple ASCII extensions (codepages) which were incompatible. This lead to communication problems, with users being forced to guess the correct encoding of every message, document and webpage. Making matters worse, publ...

متن کامل

Text segmentation for Language Identification in Greek Forums

In this paper, we examine the benefit of applying text segmentation methods to perform language identification in forums. The focus here is on forums containing a mixture of information written in Greek, English as well as Greeklish. Greeklish can be defined as the use of Latin alphabet for rendering Greek words with Latin characters. For the evaluation, a corpus was manually created by collect...

متن کامل

Learning the Greek Language via Greeklish

Learning Greek as a second or foreign language has drawn the attention of many researchers throughout time. A dictionary is amongst the first things a foreign language student uses. Reading comprehension is significantly improved by the use of a dictionary, especially when this includes the way words are pronounced. We developed a assistance software for learning the Greek Language via Greeklis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006